page stuff 



page 1 stuff (DOS) 



two machiino language routines 
(from BCPLD/1 in load string) 



Nova debugger (may not 
be present) 



BCPL runtime routines 



statics 



BCPLO 
BCPL1 
BCPL2 



BCPL stack frames 



\|/ 



/\ 



Free storage 



.'Compiler overlay area 





#400 
,71000 



'1000 



'#4400 



'#5300 



'#620Q 



-#17000 



BCPL stack frames grov/ from here towards DOS. 



If ttiese two areas collide, the compiler aborts. 



Free storage grows from beginning of overlay area 
towards 0. Beginning of .free storage depends 
upon size of overlay co.de. 



•#64000 



COr/iPILER CORE ORGANIZATION 



LEX Core OrganizaUon 



CAE, SAE, TRN Core Organization 



Resident stuff 



Frame slack 



Nk 



DICTIONARY. #10000 word 
block vvfiich goes away after 
LEX pass. 



1 



Code for LEX 
(BCPL.YL) 



DOS 



-#17000 



-#54000 



-#64000 



Resident stuff 



Frame slack 



Nk 



CAE builds the. tree in free 
storage which grovvs yK 
towards 0. I 

SAE and TRN scan the tree 



CAE, SAE. TRN overlay area 
(BCPL.YC, BCPLYS. 
BCPL.YT} 



DOS 







•#17000 



-#52000 



-#64000 



The length of LEX code 
is remembered with A/X 
during loading. 



The Max lenglh{CAE, SAE, TRN) is remembered 

by B/X during loading. 
NOG Core Organization " 












Resident stuff 


-#17000 










Frame stack 


/ 






Free storage for dictionary 






and tables 

/ 


\ 


-#33000 






! 


#5000 word code block 


-#40000 






The length of NCG code 


Code for NCG 


• 


is remembered v/ilh C/X 


(BCPL.YG) 




during loading. 








DOS 


-#64000 



COMPILER CORE ORGANIZATION (2) 



FOR 






ptr to exp 


ptr to exp 


or ptr to exp 


ptr to statement 




SW!TCH0r4 


ptr to exp 


ptr to statement 




CASE 


ptr to exp 


ptr to statement 


(character number) 





CASETO 




ptr to exp 




ptr to exp 




ptr to statement 


• 


{character' number) 








DEFAULT 




ptr to statement 








REPEAT 




ptr to statement 






\ 


(LOCAL) + name 













REPEATWHILE/REPEATUNTIL 




ptr to exp 




ptr to statement 



I ~~~ 

I Return/Enclcase/Loop/Break/Finish/Aborl 



(One word statements are one word nodes) 



Control NocJes 
(figure 5) 



FNAP 








(Binary Operator) 


ptr to exp 


ptr to exp 


or ptr to exp 


ptr to exp 










COND 


(VECAP MULl DIF REM LSHIFT RSHIFT PL 
EQ NE GR GE LS LE LOGAND LOGOR E( 


ptr to exp 


Qualdescriptor: • 


ptr to exp 


• 


ft Names 


ptr to exp 


#100000+nanie1 




# subscripts on name 1 


SIZE/OFFSET 


ptr to exp 11 





tr 


Qualdescriptor 




ptr to exp 1n 


a 






#100000+nameK 


LEFTLUMP/RIGHTLUMP 


// subscripts on nameK 


ptr to exp 


ptr to expl.K 





H 
» 
B 


Qualdescriptor 

^ n 


ptr to expN,K 



IViore Expressnion Nodes 
(Figure 7) 



Line 



(char number) 



ptr to stat 



Structure 



ptr to structdef 



ptr to Stat 



Static 



ptr to stat 



// of names 

(zlabel/label) + name 
ptr to exp 



ext 



ptr to stat 



# of names 



(extlabel/zextlabel) + name 
O 



Let 


ptr to def 


ptr to stat 



Manifest 



plr to stat 



# of names 

(constant) + name 
plr to exp 



STATEMENT NODES 



(RGURB 1) 



Dof: 



AND 



ptr to line 



ptr to line 



VALDEF 


ptr to namelist 


ptr to exptist 






FND[:F 



or ptr to namelist 



ptr to exp 











or 



DEFNODES 
(Figure 3) 





LINE 




(character number) 




ptr to and/def 










\ 


(LABEL) + name 


/ 


/ 


\ 













(LOCAL) +;name 







1/ 



Explist: 



COMMA 



ptr to exp 



ptr to explist 



RTDEF 



or ptr to namelist 



plr to statement 







or 







Namelist: 



(LOCAL) + name 







NIL 







COMMA 



ptr to namenodo/nil 



ptr fo namelist 



Expressions: 

(Noto: Anywhere in the tree, "plr to exp" is only a pointer to one of the following nodes if bitO is 0. 
II a word with bitO = 1 appears where a "ptr to exp" is expected, the right hand 12 bits are the 
dictionary identifier for a name. This is done to minimize tree size. ) 



NUMBER 



value 



STF^INGCONSTANT 



length char 1 



TABLE 



length 



ptr to exp1 



ptr to expn 



CHARCONSTANT 



charcode 



NIL/TRUE/FALSE 



0/-1/0 



VAOLF 



ptr to statement 



LV/RV/NEG/MOT/VEC 



ptr to exp. 



Expression Nodes 
(Figure G) 



STRUCTDEFS: 



FIELDLIST 



# of fields 



ptr to struct def 1 



ptr to struct def n 



OVERLAYLIST 


# of overlays 


ptr to struct def 


1 


u 
n 
n 


ptr to struct def 


n 



UPLUMP 


ptr to struct def 


or ptr to exp 


ptr to exp 





RV ■ 




#100000 + name 








BLANK 




ptr to structdef/bitdef 








#100000 + NAME 




ptr to structdef/bitdef 






BITDEFS: 


BIT 




or ptr to exp 








BYTE 




0/ptr to exp 




# 




WORD 




or ptr to exp 



STRUCTURE MODES 



(Figure 2) 



S;tatemenls: 



SEQ 



ptr to statement 



plr to statement 



IF/UNLESS/WHILE/UNTIL 



ptr to exp 



ptr to statement 



COLON 



plr to namelist 



plr to statement 











ASJ 



ptr to namelist 



plr to exp 



RTAP 



ptr to exp 



or plr to explisl 



TEST 



plr to exp 



ptr-to statement (true) 



plr to statement (false) 



\ 



GOTO 



ptr to oxp 



RESULTS 



plr to exp 



(LABEL) t- NAME 







Statement Nodes 
(Figure 4) 



Font Conventions 



Italics represents file names: DCPLO. 

Hold represents procedures, manifests or statics: KeadSynif), Ch 
IJold Italics represent lexemes, tree nodes, and ocode: AND, VALDEF. 
Note: hont conventions do not apply to headings. 



Compiler Core OrjianizatJon 



Use pictures!! 
O..J77 



1000 

...~1000 

...~4400 
...~5300 
...~6200 

...-17000 



..64000 
..77777 



standard page 0. 1 stuff 

two machine language routines 
(from BCFLD/\"\n Toad string) 

Nova debugger 
(may not b'e present) 

standard BCPL non-time roiitines 
statics 



BCPLO (main program) 
BCPU n/O procedures) 
BCPL2 (other utility procedu.res) 



The BCPL frame slack grows ' 

from here towards DOS. 

// frame space and free space collide (during a 
procedure call, or a free storage block is allocated) 
the compiler aborts. 

Free storage space for each pass 

grows towards from the beginning 
of the code for the pass 

The start of free storage for a given 
pass depends on the size of its code. 

Code for the individual passes 
is read in just above DOS. 



DOS 





...-17000 
...-44000 

*...~54000 

...-64000 

...77777 



LEX Core Organization 
Resident stuff 
Frame stack 



Dictionary (#10.000 word free storage block 
whicfi goes away after LEX pass) 



Code for LEX (overlay fUs BCPL.YL) 
DOS 



* The slart of free space for LEX is (top of DOS) - (lengtli of LEX 
code). The length of the LEX code is remembered during loading with 
A/X. 



CAE, SAE, TRN Core Or<ianr/ation 



O...*17000 
...-17000 



'...-52000 
...-64000 

..nnn 



Resident stuff 



frame stack 



CAE builds the tree in free storacie, which grows 
from -52000 towards 0. SAE anci TRN scan the tree. 

Code for each of the CAE. SAE, AND TRN passes 
(RCPLYC, BCPL.YS, BC/'LYT overkiy files) is read 
in here at the beginning of the pass. 

DOS 



*The start of free space is (top of DOS),- max(!ength of CAE/SAE/TRN code). The 
maximum length is remembered during compiler loading with B/X. 



NCG Core Orj;anization 



Resident Stuff 

...-17000 

Frame Stack 

NCG alocates space for the dictionary, and for various 
tables, above the fixed-length code block. 

...-33000 ,, ,^ 

Code (#5000 word free-storage block) 

♦...-400000 _ 

Code for NCG (overlay file BCPL.yG) 

...-64000 

DOS 

...-77777 

* The start of free storage for NCG is (top of DOS) - (length of NCG code). The 
length of NCG code is remembered during loading with C/X. 



LEX 



InM^Jr- ti J ''"' ''^"^'i'sis. pass. It reads tlic source file (and any of its ■Vet" fi'es) and 
™ ./'/''^ f^"*;^^ ^^^^^"' "i'» '' stream of lexemes. The lexeme stream is vvr ttcn as i? k 
KffA' Tr ''" '^'."i^o^'-'^y fil-^ 555./?L (which will be read VcAtV LFX build an 
vS kdnn.''"r?'h''H-^'?'"'' '-'"^.^^^it^'? the dictionary onto the temporary Hie 5 A £5 
rnd r c^AP .f ^^^"^ fi'ctionary ,s used by NCG to output external name strings for 'the 
load.r, SAL also uses the dictionary to print undefined name error niessa"?s ) 




/^O'^: many of thes ^initi^s .lave bhs se in e ?f hSFlS^l^; ^ ^^^^^ 
/£^A' (wittut'th'feft Inirh^l^T '^'''"^^^ ''^r 1?'^"'' <^^rinlSiA Ik dupFicated" r? 
are also used as ocode operators, and appear in BNCGxZ v^dl ^^^/^^^> some 

Ihtr'^lhriFX^^^-^v^^ /fZ-O-ftit is called by the compiler's main program 

a.[cr me LtiA overlay Jilt {LCI L.YL) is read in and the free storage pointer is inilialtzed. 

KeadSonrceO 

1. Creates space for a few vectors used by LEX. 

2. Allocates a free storage block for the Dictionary 
(see below for dictionary structure) 

3. Opens the source file. 

4. Opens the temp file $$$.BL 

(See the description of OpenTenip. KcsctStream. etc.) 

5. Main Loop: calls Ucadsynib repeatedly, until the end of the 
source text is reached. 

6. Cleans up the source file line pointer table. 

7. Writes out the dictionary onto the temp file $$$.BD. 

The main procedure of the LEX phase is Reai!.symJ> in BLEXl. Each time it is called if 
outputs a lexeme to the f le $$$.BL (via LcxOulHexeme^) ai d rei.mi^ f ^vn„f h. ii 
the automatic insertion of semicolon^s and /;"'rwl e.??^ s' p .^^^^^^ fS- eu a£ 
IS set. so that the next call on Kcad.symb will simply output tile lexeme tmciused^^^ 
wS'require u'^ '° ^' '"^"'''- "^^'^^"^ ^'^^ °"^I^'^^ '^''^ inforn'a'tion'for'o,^? lexemes 

Ignorable (space, tab): ignore 

Digit: start of a decimal number; read it and output NUMBER lexeme. 

Capital. Small (alphabetic): start of a reserved word or identifier 

Default: cither a special char ("+". ">". etc) or illegal; branch on the character itself. 

^Rl%yn'^ If uf^ il""'"^ °[ letters & digits), it is looked up by Hcserve(lWord() (in 
If i^iUf"' .''f■/''''''''^''•''''°'"''/°^''f^^''^" set") the corresponding lexeme is output 
f It IS get . the file name is read and the file is opened. Otherwise the name is an 
Identifier, and is entered in the dictionary; the lexeme NAME is output 

Hrfr.';^,^^r!'''fh!'^'*-c'- '?^' ^O'l^f, ^^dc appropriate to the character is executed. For most 
as for "V- %ll^^'f LcxOutfappropnate lexeme . perhaps reading ahead one character 
as for > . Some characters, like "//" and brackets, require some extra work ''"'''''''^' .. 



Source Tile Handling 



sccueSu c '.actef ^ file name c,nd . character number wjthin the 

fmv... f -. Rememlv'r tim "^et" file input is intermixed with main source fi e inpuu. 
Lrlharyt-fUes'^nwfe netted.) 'tEi constructs the following tables for lh,s purpose: 

GetnameV (two words per source file, indexed by file 

number) 

GetlineV (three words per entry) 

GetnameV: ptr to name of filel. len of file 1 _ 

ptr to name of file 2, len ot tile /. 
etc. 

GctnameP contains the number of source files *2. 

An entry is made in GetlineV each time a "get" fi'e is opened or closed. An entry 
contains: 

sequential char number of last char read 
from the previous source 

previous source file number 

corresponding char number in the source 
file . j-i. 



that entry. 

This is what the procedure WritelJne (in BCPLO) does. 



To Add a Reserved NYord to BCPLx 
Choose a lexeme name and a byte value. 

The value must be one which is not used for any existing lexeme (BLEXX), tree node id 
{BCAhX) or ocode opcode {BTRNX). 

Define the name in BLEXX with bits bl, b2. el. e2, as follows: 

If the reserved word must start a statement (like "switchon", "goto"), set "b2". 

If the word might start a statement (like "rv" or "(" ), or if it begins a 
declaration (like "static"), set "hi". 

If the word might end a statement (like "]" or ")") set "el". 

See the existing definitions for more examples of v/here the bits. are used. 

Add the new reserved word to the ReservedWord procedure in BLEX2. 

Under "case S< initial letter>". add 
"( veq("< rest of word >" )?<Iexeme>. " 

and add an extra parenthesis to the "0))))..." at the end of the case expression. 

Now LEX will output the new lexeme whenever it encounters the new reserved word in 
the source text. 

Of course, you must define the lexeme in BCAEX, and add the appropriate code to CAE 
to parse and build the tree segment for the new slatement or expression. 

To Add a New Special Character or Character Combinations: 

Choose a lexeme name and byte value, and declare it as for a nev/ reserved word. 

In Readsynib in BLEXl, there is a long "switchon" statement whose "cases" are character 
constants. A "case" must be added for the new character, or. if one already exists it 
must be modified. Use "case$=" for a model of how to read ahead to check for a' 
character combination. 

The unused characters are currently: 

^ Heft arrow) 

(single quote) 

(reverse single quote) 
\ (back slash)" 

{,} (curly brackets) 

~- i~>) (A) (tilda. not, hat) 

I (vertical bar) 



Dictionary 
The first word of the dictionary is the current lefigth of the dictionary. 

The next 52 words of the dictionary are the heads of sorted lists of dictionary entries, one for ea 
upper- and lower-case letter, in the order a. A, b, B, . . . ,z. Z. Each list looks like: 



head 



ptr to 1st 'a' 



a: 



ptr to 1st 'A' 



n cl 
c2 c3 
{BCFL 
string) 



A: 



ptr to 1st 'b' 



BCPL 
string 





BCPL 
string 



The pointers are relative to the first word of the dictionary. The list is sorted algebraically on the 
vvord of the siring, then on the second, etc. (This is not quite alphabetical order, because the first 
has the length oflhe string in its left half.) 



Most Iiiiiiorfjirit Routines in CAE: Bcadhlockbodj;, Rcom, jlgxQ 
S'stSen?s"'''" "''"''' "^ ""'' "°^''='-' ""^P" '"^"^ "''''"' ^o'-responding to expressions 
"star nodes are constructed by Re.'.cI[,lockbody()/fOf£/ and by Rcom(Nextfn) (BCAES). 

RcmllmSoShodvf ^^^ ^ilL""'^ a declaration. is legal as the next statement. It calls 
KconHKc.uHjJockoody) if the next statement is not a declaration. 

Rcom(Rcoin) is used wiiere a declaration is not permitted (that is where a dedaraiinn if 
It appears, must be enclosed in brackets, as afte/ "if exp then . ''s.j;"^'^^^ ^'^^'^ration. if 

Rco.»(Nextfn) calls Nextfor(Nextfn) after processing; a statement which does not iff-^rr 
the class of statement to be expected. So ■Rcon)(Rcom) will parse risSuenS Sf 
non-declaration statements: Rcon)(Rend})lockbo,Jy) will parse a sS-iuence of sfSement. 
:;Si^^SS.^SS1s^^S'"''"'^ ca.^^.coinF..u,?r^c^i;oSJ? SS^e;!''?' "^"^ 

;S^£l^?S^rS.^^SSn^r^$^^cr'"^ '■"" *^ ^'^ '«^^eeer corresponding to the 



Names in the CAE Tree Structure 

Names appear in the tree in two different contexts: 

1. In a node which 

STATIC 

FXT 

MANIFEST 

VALDEF 

FNDEF 

RTDEF 

COLON 

FOR 

2. Wherever " — exp" can appear in a tree node. 
(That is, a use of the naitie as an expression.) 

1. Wherever a name is declared, two words are allocated for it; either in the 
declaration node itself {STATIC, EXT, MANIFEST) or as a separate block pointed at by 
the declaration node {VALDEF, ENDEF, RTDEF, COLON, FOR). The first of these two 
words has the name's dictionary pointer (relative address of the name string in the 
dictionary) in the right-hand 12 bits, and the data type of the name in the left-hand 4 
bits.. The second word is used for various purposes, depending on the type. 

2. Wherever a name is used as an expression, the tree node word which would 
normally be a pointer to the expression node is set to '7/100000 + name"*, that is, to the 
dictionary pointer for the name, with bit + 1. [Since a tree node pointer always has 
bit -- 0. pointers to exp nodes can be distinguisned from names by bit 0. When SAE 
encounters a name (bit 0=1) where it expects a pointer to an expression node, it looks 
up the name in the dynamic symbol table, and puts a pointer to the definition node in 
its place. See the "Dvec" documentation.]! 



LINE nodes 



In 

V 



mrd°of ^Ifw/y^r'"' '1 ""'•^''^" ^Pf'f''"'"' *^^ P«'"^" ^^'" PO'"^ «t a UNf: node; the third 
comDif^ n nHn^^h'i'l'i'' points at the actual statement node. The /.//V/- nodes allow the 

oiuputs a line of source text, given the (char number) from a LINE node 

U,\t nodes may also appear in AND nodes {AND nodes appear only under LET nodes.) 



SAlv. Dccllabels and Declvars 
These are the two top-level routines in SAE; botli in BSAEL 
DoclbbeLs (address of a tree node) 

scans U,e tree brand, looUin. for fO^V „<*s ca,,h,gM^^^^ t:^!^:^^^^^ p^. 

are defined by Mich nodes. lH'C>lf'». ) i' "^",S^„\°y \'„on'^ >iod<= C» """s "hK^'i '""= 

S's.a?rf S KrXcr;lini'"at" 's^bSnU, branch or the tree. 

Declvars (address of a tree node) 

scans .he uee branch. It processes ^ ^^^;;^j:;^'%SS"hX£T'::^^^ '^p^^^ 

So the real work of SAE is done by Declvars. 



The 
label 



SAE: Dvec, the DynniiiJc Symbol Table 

Dvcc points at a vector allocated by the SAE driver DecIareNanies ?n BSAEO. There are 2 words 
entry (l)vecN = 2) and 512 entries max: (DvccMax = 512). There are three global indexes into D\ 

DveclDvccS is the first unused entry. 

DveclDvecE is the first entry made for Uie declaration node bein!^ processed. (Thai is the first 
mcomnlete entry.) So searches begin at Dvec!(DvecE-2) and proceed backwards 
towards Dvec'.O. 

nvec!l)vecP is ihe/irst dynamic variable for tJie current function or routine. A referenced dynaiv 
y^riable (defined with let or as formal parameters) must be between Dvec'DvccP am 
Dvec DvecS, since dynamic variables cannot be referenced outside of tlie procedure 
which they are declared. 




ie<^ 

There are two kinds of entries in Dvec: normal names (variables and manifests) and .structure nar 
Normal name: Dvecli ik lOOOOO + name 

Dvec!(i+1) -~- (type) + name 

type-dependent extra word 
"name" means the dictionary pointer assigned to the name string by LEX. 
#100000 distinguishes normal names from structure names. 

XvV"'^^^r^,ffl'',^,~V^.r'lf*°''?^^'" ^^ l'^^ ^^''^ f'"^^ ^^'Oi'ds allocated by the name. (In STATIC 
^'AV'^^')" , ,-/(/''^^."°^'''''^' ^'^"^ ^'■''^'"^'•'^ '""-'" ''i*^' "Oi^*-' i'^'^'f^: »" VALDE!\ I'NDEF IITDFI' 
COLON i\m\ I Oh nodes, there is a pointer to the two-word block.) A dictionary pointer' 
may be 12 bits wide; the high-order 4 bits indicate the data type of the name The nossibl 
types are: * ■ 

LOCAL. dynamic variable (defined with "let name=val" or as a forma! paramete 

CONSTANT: manifest constant 

LAin-L 

ZLABEL 

EXTLAIIEL 

ZEXTL-iBEL (all state variable types.) 

J NT LAB EL ^^ ' 

ZINTLABEL 

A iKinie is declared as a static variable in an external declaration and/or in a static 
declaration, function, or routine declaration (let F(a,b)— ). or statement label ' 
(L:— -). In external and static declarations, a name may be preceded by "@" 
indicating that it is to be a page-zero static. ' 

If a name appears in both an external declaration and one of the other kinds of sta 
declaration, its type is INTLABEU or ZINTLABEL if it was declared to be in iia^e 
zero. * "^ 

V i,^!.?.'"V^;'JE'P^'"'^j^^?v^^A'f.'""''' c^'-^claration. but not in another declaration, its type 
IS tXl LABEL or ZkXI LABLL 

If a name is declared as static, but not as external, its type is LABEL ov ZLABEL 



'I"[ie second word of tlie two-word name r^ode contains: 

LOCAL 0. to be set to the frame offset by TRN 

MANIFEST CAE leaves a pointer to the expression in this word; SAE evaluates 
it and replaces the expression pointer v/ith the value. 

other A static number is allocated ( ) and^stored in this word. This 

number is used in the Ocode to indcnttfy the variable. 

Structure names in Ovec are intermixed with normal names. A structure name entry looks like: 
Dvecti name _ 

Dvee!(i +1) ptr to structdef. 

(Structure names can appear in the tree only in certain contexts (as the right-hand op^J^ndoJ 
rrff/ulnn^^^ SIZE, and OFFSET). So normal name searches (via CeUNvithNan e) 

ignore nodes with bit 0^ 0. and structure name searches (with StrutWiU.Name) ignore normal 
names.) . 

Onlv ton-level structure names are entered in Dvec. The second word of a structure name entry is a 
pcXterU, the structure node in which the name is defined, .f h'-'^. "ock can be a 

FIELDIJST. OVERLAYUST, or UPLUMP node, or a two-word node which looks like. 

#100000 + name 

ptr to structdef/bitdef 

(a "bitdef" node is a BIT, BYTE, or WORD node.) 

The procedure DeclSfruct in BSAE2 creates structure entries in Dvcc, and scans the structure 
definition for reasonability. 

The procedure LookatQual in BSAE4 processes structure references. 



SAE Ocode 

SAE must pass informaiiori to the code generator pass NCG about each static variable it defines S 
SAL opens the Ococle temp file SSS.BC, and outputs the following: 

-^f or names of lypc: LABEL, ZLAIiEL, I/yriAnEL, ZINTUBEh 

one byte: LOC/ZLOC/INT/ZINT 
Iv/o bytes: dictionary pointer 
one byte: "value type 
two bytes: "value" 

The "value type" word and "value" depend on how the name was defined. • 

a. If defined by static <nanie>=<exp>, 

value type = static 

value = value of <exp.> (it must be a ?-time constant) 

b. If defined by let <name>=<formaIs> . . . (procedure) 

value type = ENTRY 

value ^ a sequential number allocated for each ENTRY ov LENTRY, 

c. If defined by <name>: (statement label) 

value type = LENTRY 

value = a sequential number allocated for each ENTRY OK LENTRY. 

-^For mimes of type EXTLABEL or ZEXTLAIiE/^ 

one byte: EXT/ZEXT 

two bytes: dictionary pointer 

That is. external names which do not appear in a static, function, routine, or lab.. ! 
declaration have no value. The Ocode information for such names is output only if the 
name is used in some expression. 

When an SAE Ocode item is output for a name a sequential number is assuned to that name Wh( 
IRN later outputs an Ocode item which refers to that name, it identifies'^the static name v/ith Ih 
number. So the order in which sialic names ore output by SAE is important. 

For procedure and label names, another sequential number is assis;ned by SAE. When TRN lat 
generates an Ocode item for a procedure entry point or a label dcfinrtion, it ouputs both the sequent! 
static number (identifying the name) and the sequential entry number (identifying the entry point) < 
that NCG knows what entry point to assign to each name. ' " 

The SAE Ocode is scanned by Scanlnijiure.s in BNCGL 



TRN scans the Iree and outputs tlie remainder of the Ocode stream. The principal routines are 
Trans in BTRNI, which processes statement nodes; and Load in BTIiNS, which processes expressions 
which Trans encoinUers m the tree. 



Lo;i(I (pointer to expression node): 

Generates the appropriate Ocode to place the value of the expression on top of the sinu,?atcd stack 

/.A' value diet name for a manifest name 
LP frame-Ioc diet name for a dynamic name 

LL static number diet name for a non-page-zero static name 

/.Z static number diet name for a page-zero static name, 
and increases SSP by one. 

ki,S^f'K irir'"^"" '''' ''^ ' = '• '^ P°'"^^ '' ^" ^^f'^^^^'^" "°^-. -o l.o:ul branches on the 
1. For binary operators; Load basically does 

Load (left-hand expression) [which increases SSP] 
Load (right-Iiand expression) [which increases SSP] 
output binary operator 
decrease SSP 
return 
These operators are: 

l:Q Nl- LS IE GR GK 

MULT DIV JiEM LSI I! FT RSHIFT 

PLUS MINUS LOGAND LOGOR EQV NEQV 

VECAP 

a mnS""nf 'h^,^'"'''^' «Pf^'^^"^s -'"tJ ^or relations, the right-hand operand is output first if it i. 

2. Other simple nodes: 

NOT,NEG,RV : Load (operand) 

output NOT, NEG or RV operator 
TRUE, FALSE ; oi,tp(,t yy^iy/.- /-vj/^^'^ operator 

NUMBER : oinpia A/V operator 

output binary value (2 bytes) 

[M' expects the name of a manifest after the value.] 



CHARCONST : output AC operator 

: output char code (2 bytes) 

STRONCCONSr : output mVv' operator . ,. , . 

output byte stream: length (1 byte) 
char 1 (1 byte) 

char n (1 byte) 

■[■y\lilE ' output y/f/?/./; operator 

output table length (2 bytes) 

output table values: word 1 (2 bytes) 

word n (2 bytes) 
3. LV Load calls LoatlLv . " 

The operand of an Iv-expression must be a name, an rv expression, a vector subscript expression, a 
structure reference, or a conditional expression. 

^ ■ ■ 

-> For names, LoadLV outputs: 
dynamic variable: /X/* frame effect diet name 

non-page-zero static: ILL static number diet name 
page zero static: /ZZ static number diet name 

-^r^^or vector subscripts, _ "; 

Loat! (left-hand operand) 
Load (right-hand operand) 
output PLUS operator 

-♦For rv-expressions. 
Load (operand) 
output LFAT operator. 

(This is done because "Iv (rv x)" is not equivalent to "x" if bit 1 of "x" is set during execution. 
NCG generates a subroutine call to check this, and follow the indirect chain of x .) 

->For structure references. LoadLV processes the field qualifier, and then generates Ocode to divide 
the bit-offset of the field by 16, if necessary, and to add in any remainmg constant word-oftsel. 

->For "lv«exp>?<expl>.<exp2»". LoadLV generates Ocode for ••<exp>?lv<expl>,lv<exp2>". 



4. VALOF 

Load calls Trans to process tlie statement operand of VALOF. A "resulls" statement in this block 
vv.li cause Intns to output a Ococle operator which will in turn cause NCG to compile a iur 3 to an 
in erna abe . After IVans has processed the statement. Load assigns this label (outp uttfni /iV 
in ernal abd number) and then outputs A\S7V1C:a- stackpointer. which indicates to NCG that a result 
value IS to be found (m AGO) at this point in the program. 

5. COND 

Load generates Ocode for "<exp>?<cxpl>.<ext<-p2>" as if it were "valof [test<exp>then 
resulis<expl> or results<exp2.J ^ 

For example, "x Is 0? -x, x" outputs the following: 

fC, ^ (load dynamic var "x") 

{'{^ (load constant "0") 

'"> (less than operator) 

{'•'''bl (if false, jump to internaliabel) 

('/; X load "-X" and 

RnSlabl (preserve result and jump to end of condition) 

LAh labl (assign first label here) 

f'J,^ (loacT "+x") 

A7:5'lab2 (pre^.erve result and jump to end of condition) 

ni-rl^^]^ (assign result h»bel) . 

hSrACK p (push a value onto the simulated stack at 
stack portion p) 

6. Fl^AP 

Load calls Transcall in BTRN3 

Transcnll also handles l^T/^^JpvJvan^. FNAP and RTAP arc different only in that //Y/fP anpears in 
an expression context, hile /?f/lP appears in a statemenl context. ''I>pcari, jii 

Ocode: FNCALU RTCALL (operator) 

For each parameter: 

Z,on</(p'Tr'i meter i) 
PARAM (operator) 
parameter number (one byte) 
total # of parameters (one byte) 
diet name of procedure (two bytes, 
if not a name) 

Load (procedure expr) 
FNAP/RTAP (operator) 

7, Structure References 



total # of parameters (one byte) 

stack position at beginning of call (two bytes) 



Structures 



CAE 



It builds 

es. 



SAE 



TRN 



NCG 



RstrucKread overlay switch) in BCAE4 parses structure declaration It bi 
striictdef nodes •- see the tree node templates for the format of these nod 

Rnu-ilnamernumber of extra words in node) in BCAE4 parses struclure references. It 
bui ds f node clEafnins tiK^qualifier descriptor w^lh the s^^^^^^ number of words 
at the beginning (2 for SIZE, OFtStT. 3 for LEFT/ HIGH fLUMF)- 

neclvnrs in BSAEI calls DeclStruct in BSAE2 to scan structure declarations It declares 
top~kvtVnam^^^ nvec.evaluates replication and size constants .n the declaration, and 
replaces RV (@) names with pointers to the defining structure. 

lonior in /?^,^F?rnll'; 1 ookatOital in BSAE4 to process LEFTLUMP. RfGUTLUMP. SIZE. 
non-consta;it sJbSipts. and constructs a representation of the field-access algonlhm. . 

DoOual in BTRN3 optimizes the algorithm given by LookatQual and generates the basic 
Ocode for the structure reference. 

The routines in A\VCC75 handle most of the structure Ocode items. CCplus in BNCG7 looks 
ahead for some structure Ocode items for optimization purposes. 



BSAE4 



LookatQual (addr of start of quahlesar in LEFTLUMP, RIGIFFLUMP, SIZE, OFFSET oco6o 
(see the template for these tree nodes) 

This procedure creates a vector (in the tree space) ^pcTi^^ing the rcfc^ and puts a 

pointer to it into the node (node!3 for LUMr$,.nodd2 for SIZE and OIISLO. 

This vector has the form: 



0: offset (in bits) after the last non-constant subscript 

1- size of field in bits 

2: number of non-constant subscripts 

for each non-constant subscript: 

offset (in bits) to this subscript 
-vexp for subscript 
->(low limit value, high limit value) 
size of replicated element 



(Offsets for constant subscripts are computed and added in at compile lime.) 



So the algorithm for computing the bit number of the first bit of the accessed field of: 



K 



Ki 
->X 



Si 



I I 



i = 1 to n (n non-constant subscripts) 



is K + 2(Ki + (X - L )*Si) 
i i i 

(This is a run-time algorithm, since the X are not constants.) 

i 

DoQual in BTRN3 optimizes this expression as it generates the Ocode for the structure 
reference. 



liTRNS l)&Qiia!(inputdescriptor. outputveclor, wordsubscriptswitch) 



inputdescriptor is the address of a vector generated by LookatQual in DSAM. 



word 0: K (final offset) 

1: S (size of referenced field) 

1: n (no. of non-constant subscripts 

3-^3+(4i-n) : K (offseti) 
i 

— >e-xp (ith subscript exp) 

i 
— >(L .11 ) (subscript limits) 

i i 
S (size of replicated field) 
i 



outpulvcctor 



is a four word block for liie final Ocode in formation. All preliminary 
Ocode needed for the compulation of the non-constant subscript offsets is 



generated. The oulptitvcctor contains: 



woid 0: Ocode operator 

1: constant part of offset computation in words 

2: constant part of offset computation in bits 

3: size of final field in bits. 



wordsiibscriptswitch 



if false, the subscript computation is carried out in terms of bits; if true, as 
much as possible is done in terms of words (bits/16). (The "bits" mode is 
used for OFFSET nodes.) 



The possible Ocode operators in outpulvector'.O are: 



WQUAL :thc field is a full word or a part of a single word 
XQUAk the field is < 16 bits wide, but overlaps a word boundary. 
M\V<2UAL the field is n>l words long, and starts on a word boundary. 

(The only non-constant subscripts are on elements which are.I6*n bits wide) 

YQUAh the field is 8 bits long and starts on a byte boundary 

W'HQUAL the field is <16 bits wide, and a non-constant bit subscript is present 

MWnQUAk the field is'>16 bits wide and a non-constant bit subscript is present 



WQUAL 



Load (wordaddress) 

wordoffset 

biloffscl (0-15) 

length (bitoffset + length <16) 



XQUAk 



Load (wordaddress) 

wordoffset = 

bitoffset (1-15) ■ ,^^ 

length (<16; btit (bitoffset+length)>16) 



YQUAL 



Load (bytepointer) 
wordofrset~0 
bitoffset=- 
!cngth=i 



MIVQUAL 



Loacl(worcladdress) 
v/ordoffset 
biloffseuO 
Iengtli:.16*n n>l 



MWBQUAL 



Load fwordaddress) 
Load (bit offset) 
vvordoffset--0 
biloffset=0 
length (<16) 



MWBQUAL 
Load 



Load fwordaddress) 
Load (bitaddrcss) 
\vordoffset=0 
bitoffselr-O 
length (>16) 



OCODK (output by TRN) 

S is the sitnulated strick. Ocode operators arc one byte. Arguments to Ific following operators are 
two bytes, left followed by right, unless otherwise indicated. 



LINE charnum 

PUNE charniim 



Inserted in the Ocode before most statements, so that NCG can print the 
corresponding source code on errors or on the ASM listing. jPL/yV/i' appears only one 
for each line of source text; LINE appears for each statement on u source line. 



LP frame position 0/name 

Push onto S the value of the dynamic variable allocated at this frame position. 
LN binary value 0/name 

Push the binary value onto S. ,, 

LL & LZ static var number 0/nanie 



Push onto S the value of the static variable whose number is given. {LZ=> it is a 
page zero static). 



LLP frame position 0/name . 

Push onto S the address of the dynamic variable. 
LLL & LLZ static var luimber 0/name 

Push onto S the address of the static variable. {LLZ-^ page zero static) 



^'^ binary value 

I-iish the binary value onto S. 
f-STR <byto stream > 



TABLE <doublc-byte stream > 



TRUE 



Push the value iniTlll onto S. 



FALSE 



Push the value onto S. 
LLVP frame offset 

S."af^^J'^.Jl^:Sl^^^!S '""^ "f ^^'^'^'^ '^ <^-"- °^^-^> -ords below th 
NEW LOCAL name 

Declare a new dynamic variable v/Iiose frame position is the current inn nf c; tf,„ 
value currently on the top of S is the value to be stored in the variabig ^^ ^''^. 

"let xnlO" generates /,A' 10 



NFJVLOCAL (name of x) 
"let v^vec 10" generates . IIVP <??> 

NEWLOCAL (name of v) 



SP frame offset O/name 



Store the value at the top of S into the dynamic variable whose frame offset 
is given. Pop S. 



SL, SZ static var number O/name 



Store the value at Ihe lop of S into the static variable whose number is given. 
(5'Z=^pagc zero static) Pop S. 



RV 



Treat the value at the top of S as an indirect address word. Replace the top 
of S with the contents of the memory address at references. 



VECAP 



Treat the sum of the top two values on 5> as a memory address; replace those 
two values with tlie contents of that, memory address. 



STIND 



Treat the value at the top of S as an indirect address v/ord. Store into the 
location it references the value at (top-1) of S. Pop S twice. 



STVECAP 



Treat the sum of the top two values on S as a memory address: store into thai 
location the value at (lop-2) of S. Pop S three times. 



Lnvii 



I ruit the value at the top of S as an indirect address word. Replace the tor 
of S with the memory address that would be referenced by the indirect add r 
word. (i.e.. follow the indirect address chain.) ^ '"uirccc acicir 



■'x=rv y" 



"x=y!i" 



rv x=y" •■x!i=y" x=lv(rvy) 



LPy 
RV 
Sl> X 



LPy LPy LPy 

LP\ LPx LPx 

VECAP S'flNl) LP i 



LPy 

LVRV 
SP X 



PLUS 

MINUS 

MULT 

DiV 

REM 

LSUFT 

RSHIFT 

LOGAND 

LOGOR 

HQV 

NEOy 



LS 
LE 
GR 
GE 



Binary operators: replace tlie top two elements of S with the value 

of the onerator applied to their values. (The top element is tlie right-hand 



A? rdS iffi^'^l^ViJ^f^dS.'^'""'"^^ ^' ' '^ "" '"''''' *^"^" '' ^'- 



NEG 
NOT 



Binary operators: replace the top element with the result of the operator. 



JUMP interna! labc-l number 



Transfer control the place where LAr> <internal label ni!mber> appears in the Ocode. 



JT 



internal label number 



Transfer control if tlie value at the top of S is non-zero, i'op tlie value off S. 



//' internal label number 



Transfer control of tlie value at the top of S is zero. Pop the value off S. 



RES interna! label number 



Transfer control as for JUMP, after preserving the value at the top of S (i.e.. load it into 
ACO). (Generated by •*resuUs<exp>") Pop the value off S. 



LAB internal label number 

Assign the place to which the above should transfer control. 
(L/i/J'-s- are also output for "case" labeLs.) 

GOTO ... J 

The value at the top of S is the address to which control should be transferred. Pop S. 



FINISH 
ABORT 



End execution of the entire program. (On the Nova, do a .RTN system call.) {ABORT types 
a message on the terminal.) 



SWITCIION 



number of cases default label number 

case value case label number 



1 



1 



case value case label number 

n 



n 



(2+n*2 parameters, each 2 bytes) 



The value at the top of S is the value to be branched on. If this value is amon^ the ^r-*.;.^ 
valuc^> values, transfer control to the corresponding <case label number >fotfervvfse! 

tr^'uisfer control to the <dcfault label number>. Pop the value, off S. ^ 



(/,///; ocode operators are output by TRN at each case label.) 



LEN'riiY static var number entry point number 

5wS"' '"'''"' '" '^'''^'"'^'"^ '''^^■' 's encountered. Its two parameters are like the first two of 
KNTRY static var niunber cntrypoint number jMaxSSP MaxVccSSP 

The static variable is defined as a function or routine which beiMns here The CcnWv nriJnf 
ourpjrby-lAg"''' ' '"''"^ ''"'"' '"''^'^ ''•' '''''' '''''^' Ococie^information uS S"' 

MaxSSl* is the size of the frame which will be needed by the code for this procedure 
SAVE initial frame size 



This Ocode generator appears only after an ENTRY opcrsKn. Its parameter is the number of 
fornial paranielers plus the niiniber of words needed for the frame header. 



NUMARGS name 



If a "numargs" variable was specified after the argument list, this Ocode operator causes it to 
be declared. 



RTRN 



Generated by "return", and at the end of a routine body. Causes a return from the current 
routine. 



FNRN 



Generated at the end of a function body. Causes a return from the function, with the value 
at the top of the stack S as the result value. 



ENDFRAME 



Marks the end of the procedure body begun by the corresponding ENTRY. 



RTCALL 

I'NCALL 



Sienals the beginning of a procedure call. {RTCALL \f standing alone as a statement. 
I'NCALL if used in an expression.) 



PARAM i(one byte) r>(one byte) 0/name(2 bytes) 



The value at the top of S is the ith parameter to be passed to a procedure. A total of n 
arinimcnls are to be passed. The name of the procedure to be called is given as the third 
parameter to PARAm\ (if the expression before the argument hst is not a nanie, is used). 



RTAP 



J'NAP 

n(one byte) p(2 bytes) 



The valiie at the top of S is the nddress of the first instruction of the roiiiine or function to 

DC Cti t JCCI. 

n arguments are to passed; the argument values are the vahies in (top-1) to (top-n) of S. 

p is the offset of the top of S at the place in which the RTCALL or /WC^ILL appeared fit 15 
included here for checking only). hi ^i u-- >^ 

If the operator was FA'AP, the value of the function call is left on the top of S. 

STACK stack lop position . 

Force the top of S to be at <stack top position>-l. 
liSTACK stack top position > 

Force the lop of S to be at <stack top position>-l; then Push onto S the value that was 
preserved by the RkS {hixl was just executed. 



RES labnurn is generated by "results". At the end of a "valof" block, the <labnum> 
IS assigned (with LAB iabnum). The next Ocode item after this LAD is RSTACK. 

So A'^W forces a value into ACO. generates a jump, and then forgets about the value it 
loaded. RSI ALA pretends that some value has just been loaded into ACO. 

The purpose pf STACK {^m\ RSTACIC) is to indicate wliere the simulated stack top should be 
at the beginning and end of blocks; that is, it appears after' declarations, and afier the last 
statement of a block that began with a declaration. Basically, then. STACK (and RSTACK) 
are used to keep fRN and NCG synchronized v/iih respect 10 wliat ihe value of "SSI*" 
should be at critical points. Certain other Ocode items are also accompanied by an "SSP" 
value for the same reason. 



STORK 



This Ocode operator is output in order to force NCG to generate code to store into actual 
frame locations any values on the simuhited stack which have not been fully processed. 



57'0A*Zf should not really be an Ocode item; NCG should know when it is necessary 
to clean up the simulated stack. 'i"he operator is left over from TX-2 days. 



ar};vec is used as a stack witli five words per entry. Each entry describes an item on the 
simtihited stack; that is, a riin-tirne operand. The NCG variables arjjJ. nrgl. and ar<',3 
always point at the top entry, the entry next to the top. and the entry below that, 
respectively; when an entry is pushed or popped, these pointers are changed. So tii\\h arg2. 
and nr<.',3 are pointers to the three most recent run-time operand descriptors which have 
been processed. 

An ary,vec entry consists of five words; if arg is a pointer to some nrgvec entry, its fields are 
referenced with type!ar<;, loclarg, rcflarg, poslarg, and iianielarg. 

type!ar;j indicates what kind of operand is being described; 

loc!ar^ depends on the type. The possible types, and the meaning of the corresponding loc's 

are: 



LOCAL frame offset 



Word <frame offset> in the frame of the current procedure. 
This describes both dynamic variables and temporary cells 
used in the frame. 



LABEL static variable number 



Static variable. The <static variable niimber> is the number 
used throughout SAE. TRN. and N'CG to identify, the static. 



COMMON static variable number 



NUMBER binary value 



RVLOCAL frame offset 



Page zero static variable. The <stalic variable number> is as 
for LABEL 



Constant with the given value. 



Describes an operand which is to be referenced by 
indirection through the <frame offset>-- word of the frame- 
This descriptor would be generated, for instance, by an 
operand "rv x", where "x" is a dynamic variable. 

RVLABEL static variable number 

Describes an operand which is lo be referenced by 
indirection through the static variable corresponding lo the 
<stalic variable nuinber>. 

RVCOMMON static variable number 

Like RVLABEL but the static is a page-zero static variable. 

RVNUMBER binary value 

Results from "rv n", where "n" is a constant expression. This 
operand is to be referenced by indirect through a memory 
word v/hich contains the <bjnary value>. 



LVLAIUIL sialic variable number 

Describes an operandi whose value is the address of the sialic 
variable. Results from "Iv s", where "s" is sialic. 

LVCOMMON static variable number 

Like LVLABEI^ but the static is a page-zero static variable. 
AC 

Describes an operand which is in AGO or ACl. 
A7e 3 

Describes an operand which is in AC3. 

Temporarily ignoring the other fields of an ar-^ let us look at an example of how NCG 
processes a statement. 

Assume that p (i„ .vcud 6 of the frame) is a dynamic variable and s (static var number 13) 
IS a static. I hen the statement ^ ■''' 

s = rv p + 2 

would generate this Ocode: 

LP 6 

RV 

LN 1 

PLUS 

SL 13 

NCG proceeds as follows: , % 

LP 6 



RV 
LN 2 

PLUS 



^♦Jl^ses the descriptor LOCAh 6 to be pushed onto arj^vec. So argl points al 

is a unary operator, so it is to be applied to the operand described by arvj 
I his changes the descriptor in arnl to KYLOCAL, 6. .' 

causes the descriptor NU.MfJKR, 2 to be pushed onto nrgvec. So now: 

arg2 "> RVLOCAL, 6 . 

argl -> NUMBER, 2 

is a binary operator, so it is to be applied to arg2 and nrgl. This will cause 
the code neneralor to generate the necessary code to load the operands 
described by ;trg2 and argl into free accumulators, and then generate an 
ADD instruction: ■ 

LDA @6,2 //rv of frame word 6. 

LDA3.+ K1 /A+Kl will contain a 2. 

f^^^^J 3.0 //add. leave, result in AGO. 

(in the process of generating this code, argj and aig.2 are modified several • 
times; v/e will ignore this in this exam[)le, since they will po avvav 
immedialeiy.) .- c j 



SL n 



After t[)is code is generated, ai<!vec will be popped twice, and then the 
descriptor AC. will be pushed. So now 

ar^2 -> ? 
arjil -> AC, 0. 

indicates that the operand described by i\\g\ is to be stored into static 
variable #13. So NCG will, upon encountering this ocode item, push a 
descriptor LABEL 13. giving: 

ar<;2 -> /fC 
argl -> LABEU 13 

and then generate code to store arg2 into the operand described by arjjl. (If 
an'2 were not in an accumulator code would just have to be generated to 
lead it.) So 

STA @.+K2 . //.+ K2 will contain 

//the address of s 

Now argvec is poped twice, and we are done. 

Note that no code was generated until the PLUS was seen, because the operands up to that 
point could be described by ari>vec descriptors. One of the purposes of the argument stack 
IS to allow the generation of code for an operand to be postponed as long as possible. 

The name field of an argvec entry contains the dictionary jpointcr for the name of the 
operand, or if it is not a variable or manifest. The name Held is used only to generate 
meaningful comments on the /A listing, if it is requested. 

The ref word in argvcc entries allows more complex operands to be described in ar<^'vec so 
that code generation can be postponed even further. If arg is a pointer to an argvcc entry 
rcflarg is interpreted as follows: 

if rcflarg is 0, the operand is as described above. 

if bits O-il of rcflarji are 01, the operand resulted from a vector subscript 
expression; where one operand, of "!" was a constant between -#200 and +#177. 
'I he right-hand 8 bits of rcfjarg contain the constant subscript. For example, 

LOCAL loc = 6, ref = describes an operand 

which is in word 6 of the current frame; it is loaded into ACQ with 
LDA 0. 6. 2. 

LOCAL foe = 6, ref = #40007 describes an operand v/hich is the result of an 
expression like "v!J", where *'v" is the dynamic variable v/hich is allocated to 
word 6 of the frame. This operand is loaded into ACO with: 

LDA 3 6,2 

LDA 7.3 

if bits 0+1 of rcf.'arg are 11, the operand resulted from "rv" applied to a vector 
subscript expression where one operand of "!" v/as a small constant. 

LOCAh !oc = 6. ref = #140007 results from "rv(v!7)", and is loaded into 
ACO by 

LDA 3 6,2 

LDA ©7,3 



The only legitimate values for a lef field are: 

normal addressing 
#40000+n subscripted- reference: 



n IS an 8-bit signed value, 
so the ref field for "v'-V" 
would be #40371. 

#140000+n rv of a subscripted reference. 

^^^r^T^:i^!,£^,;gS-^^l^^^ ^direction, is only set if 

may be 0. E.g. the Sf fliSd for 'v!{y"'i'; ^moo!' '""^ ^'' ^ '^^ '^ '^'^^ " 

tJoZt%' ^Ill^r ''"^ ""^ "°^ "^'^^ ' "— ° -^^ f-'^- '^ a VECAi' Ocode operator is 

M-yp -* AQ !oc = 0. ref = 

ar<il -> NUMBER, loc = 7. ref = 0. 

MOV 0, 3 

then pop ihz two arj-s and push XR Inr - ^ r^-f y/ mnm ir .i • 

into At 0. it wiir be dorle with " ' ' *'^°°^^- '^ ^'"' ^^^^'"''^"^ '^ to b^- Joacled 

LDA 7.3 

The Ocode for: x- (a-hb)/(c*d) is 

LP X 

LP a 

/./' b 

PLUS x.a.b.c.d mean the 

y,y ^ fi'"?ie offsets for these 

^i,r,:- variables. 

MUH 

DIV 
MINUS 

By the lime MULT is encountered, argvcc contains 

> LOCAL X. 

arg3 ~> AC 

arg2 ~> /.0C/</. c 

aigl -> /.OO^/. d 

and the code 

LOA a.?. 
LDA 3 b.2 

has been generated. 



Now code must be generated for multiplying c and d; this will require using both AG's, so a 
temp is needed for arjj.l Cells in the frame are used for such temps; pos!;ug3 contains the 
offset of the temp for arj;.!. 

Assume that tios!ar£;3 = 11 (See belov/ for how pos gets set) 
So the MULT will generate: 

STA <n,2 

LDA c,2 

LDA 1 d.2 

JSR multiply routine //leaves result in ACl 

When the store into temp gets generated, the descriptor for arg3 is changed to 
LOCAL loc " 11. So after the multiply popped arg2 and argl, and pushes the result 
descriptor AC, 1, argvec contains 

arg3 -> LOCAL x 
ar«j2 -> LOCAL II 
argl -* AC 1 



Now DIV must load arg2 into AGO before doing the division; since arg2 is LOCAL, 
this is done by 

LDA 11.2 

Then the rest of the code is generated by DJV and MINUS: 
JSR divide subroutine //leaves result in AGO 

LDA 3 X.2 
SUB 3.0 

and the final argvec is 

arg3 -> ? 
arg2 -> ? 
argl -> AC 0. 

Any operand in argvec may need to be stored in a tcnip at any time, so a frame word is 
reserved for each operand which is stacked up in argvec. Frame space is, of course, also 
used for dynamic variables; this variable space is allocated and do-allocatcd according to 
the lexical scope of declarations. 

let f(a,b,c,d) be 
Pel X = - - - 

pet p.q = - ■- - 

let y = " - " 

X t! x~ (a+b)/(c*d) //same expression as in the 

//example above. 



The frame for this procedure looks like 



I fraincheader 
2 

3 

4 a 

5 b 

6 c 

7 d 

8 X 

10 q 

II - temps which might be needed 
12 - for processing x-(a+b)/(c*d) 



In the inner block, words 9h 10 jire reserved for p and cj. When this block is exited 
they are unreserved. Then word 9 is reserved for y. So at the bei^inninr; of the final 
assignment slaten^enl, the first unused word of the frame is word 10; 'this is where 
temps needed for this expression will be allocated. When the MULT ocodc operator 
IS encountered, arj;vcc will contain: 



— > 


LOCAL 


8 





10 


arg3-> 


AC- 








11 


arg2~> 


LOCAL 


6 





12 


argI-> 


LOCAL 


7 





13 



type loc ref pos 

//x 

//(atb) 
//c 
//d. 

So word 11 of the frame will be used as the temp for saving arg3. 

An aside on frame size cletermiiiaiion: 

The frame size needed for this procedure would be (at least) 14 words since 
word /.? of the frame was reserved as a temp for 'V/" in computing 
x~(a-f-b)/(c*d )". This is the case even if, as in the above, that word is never 
actually needed. (The reason is that by the time NCG finds out that it is not 
needed, if is too late to do anything about it; TRN determines how the frame space 
is to he allocated.) 

If a procedure allocates vector space ("lev v - vec 100" ) the variable ("v") is 
allocated atuong the other dynamic variables and temps; but the vector space (the 
101 words that "v" v.'ill point to) is allocated below the last temp word. reserved 
So if tlie declaration of "y" in the above example were "let v = vec 100", "v" would 
still be word 9, and would point to word 14 of the frame at run-time. The frame 
size would be 115 words (14+101). 



SSP and an-jvec Handling \n N'CG 



SSP is the global NCG variable which keeps track of allocation of words in the procedure 
frame. The value of SSP is the offset of tne first unused frame word relative to v/ord of 
the frame. Thus SSP alway.s has the value po.s!ar<;l+L 



SSP is initialized by case ENTRY: of the procedure Scanpurcs in BNCG2. (The nNTRY 
Dcode item is followed by a SAVE, whose 2-byle parameter is the initial value of SSP). 
hiitstack [in BNCGf} is called, which sets up ar'{^vec and initializes SSP to 4 -i- (numberof 



SSI' 
ocodt 

I' . . . 

formal parameters to the procedure). At this point, ar;;vcc has three dummy entries, 
corresponding to arg3. arj^Z. and argl; each is LOCAU with ioc, and pos fields set to SSP-3. 
SSP-2. SSi>-l respectively. 



Push (type, ref, Ioc) pushes an entry onto nrgvec and bumps SSP. Popl() and Poi)2() pops 
one or tv/o entries and decrements SSP. 

SGtSSP(newvakie) sets SSI* to a new val.ue. If the new value is smaller than the current 
value, it does the appropriate number of pops. If the new value is greater than the current 
value, it pushes tne appropriate number of temp cell descriptors. 

Clcarsfacl<(SSPvalue) generates code lo stofe into temps every operand below the SSPvalue. 



CGmeiDief (op, jiry) fin lJiNC:G43 



mmmimmsmim 

op may be: 

JMP. JSR. ISZ, DSZ 

J-DAO. LDAl. LDA2. LDA3 

tfv , • ^^> "; r^'^'^ !• STA 2. STA 3 

(that IS. for LDA-hSIA. the AG field is part of op) 

nfpl"->n.-/'°ir.S ^T.'' ^''^-'y^'^ f'Perand descriptor, usually in ar..vec fie 



reference) 



CO__mem ref _compi les jhc_f ol 1 owiris_ cod 



X 

x!n 

@(x!n) 
@x 

(@x)!n 
@((@x)!n) 



'r>'pe!ar<; Ioc!ar<; 

WCAL p 

LOCAL p 

AOC/i/. p 

RVLOCAL p 

RVLOCAL p 

IIVLOCAL p 



(s is a static) 



s!n 
@(s!n) 



@s 

(@s)!n 

@((@s)!n) 



LABEL 
LABEL 
LABEL 



liVLABHL 
RVLABEL 

RVLABEL 



rcf.'arj,' 

^ 

//40000+n 
#!40000+n 

#40000-1-11 

#140000-n) 





#40000-!-n 
# 140000 H) 





#40000i-n 

#14000a+ri 



code 

op n.2 

LDA 3 p.2; op .1.3 

LDA 3 p.2; op @n.3 

op @p. 2 

LDA 3@p.2; op n.3 

LDA 3@p.2: op @n.3 
LP IS the offset of 

tlie word in the 

frame.j 



op @.+N 

LDA 3@.-hN: op n.3 
LDA 3@.(-N; op @ n,3 
[.-i-N contains the 

address of the 

staticj 
op ©.-i-N 



LDA 3 @.-i-N: 



oi> n.3 



LDA 3 @.-f-N; op@n. 3 
L.-t-N contains the 
address of the 
static + #100000] 



Typelary loc!;irj; 
(z is a page-zero static) 



rcV.nrg 



code 



7. 


COMMON I 





op a 


z!n 


COMMON 1 


#40000-i-n 


LDA 3 a ; op n.3 


@(/.!n) 


COMMON 1 


#140000-^n 


LDA 3 a ; op @n,3 


@/. 


nVCOMMON 1 





o> @ a 

6:)A 3 ©a :op n,3 


(@/.)!n 


RVCOMMON 1 


#40000+n 


@((©z)!n) 


RVCOMMON 1 


#140000-Hi 


LDA 3 @a :op @n.3 
[a is t!ie address 
of the static in 






• 


page zero.] 



(k is a mimbcT. or a manifest name) 



k 


NUMBER k 





k!n 


NUMBER k 


#40000i-n 


@(k!n) 


NUMBER k 


#40000+0 



op .+N 

LDA 3 .+N; op n,3 
LDA 3 .+ N; op@n.3 
[.+ N contains 
the value k.] 



Iv s 

(lvs)!n 
@((lv s)!n) 



LV LABEL 1 





LVCOMMON 1 





LVIJBEL \ 





LVCOMMON 1 


//40000-Kn 


LV LABEL 1 


#140000+n 



op .+N 

LDA 3 .+N ; op n,3 

LDA 3 .+N : op @n.3 
[.+ N contains the 
.address of the static 
or page zero static.} 



CGmcnircf handles RVNUMBER specially if reflnrj', is 0. 

L If the vahie of the constant is between and #377: 

@k RVNUMBER k op k [i.e.. like 

a page zero 
■ reference} 

2. If the value of the constant is between #100000 and #100177. and not if compiling for 
Alto. 



@k 



Otherwise, 



RVNUMBER k 



RVNUMBER k 
RVNUMBER k 
RVNUMBER k 









#40000hi 
#140000 i-n 



op @ (k S#377) 
[i.e.. in indirect 
reference through 
a page zero location. 



op @ .+ N 

LDA 3 @.+N; op n.3 
LDA 3 @.+N; op@n.3 
[.-iN contains the 
value k} 



. Note: 

The onerand t^'pe RVLAUEL is never generated if compiling for the Alto fbecni.se 

op ^-vH" ^,^^c^rt ".-.-N" contains #1 + address of static docs nrndo a 

'nlr? '.""'""'"'"'"• '' '^ ^'°" ^" ^''^ ^'°^''>- '"Stead. "@ s'\ where s is? static. 

LDA 3. @.+N //.+N contains addr of s. 
op 0.3 

The more complex cases ( "(@s)!n" and "©©sin)") are done by 

LDA 3 @.+N 

STA 3 temp.2 • . 

and setting the urg io liVLOCAU temp. <ref> 
c£vjrru;nSi™0)rS, "rteg,."" """"^'^ """"^^ or operands «„ ..rs^c are 
CGivO applies the "rv" operator to aigK 
CGMd)scr(i) essentially does (arK2!(arj',l+j)). where j is a number between #-200 and 




iiSesl'mc^f^'SpSlSuir'"'"" '""'"^ '" '^'^''' ^'" '\ -"^'^'--^blc amount of 

CGniernref (op.arj;) [in nh'CG'l'} '' 

This routine .generates the memory reference instruction op for the operand described bv 
ar^ t compiles any preliminary code needed to set up tlie reference (e.g.. loadin- ACS 
vv.th the base address /or a subscript reference) computes the address (indnect. inde^x and 
displacement fields) for tite instruction, and finally generates (op V address) 

op may be: 



JMP. JSR. ISZ. DSZ. LDS 0. LDS 1. LDA 3. STA 0. SPA 1. STA 2. STA 3 
(That is. for LDA + STA. the AC field is part of op.) " 



ar{; is: 



a pointer to a five-word operand descriptor, usually in ar"vcc (i e ar-1 
ar};2, :ii<'..3) I he descriptor may not be of type A(\ it may be of ivpe M 
only It rct!ar<; is non-zero. (That is, arj; must describe a memory reference) 



CC'iaemref conipiles the following code: 
typelfirg lodarg reflarg 



code 



X 

x!n 

@(x!n) 

@x 

(@x)!n 

©((@x)!) 



@p,2;op@n.3 



LOCAL 

LOCAL 

LOCAL 

RVLOCAL 

RVLOCAL 

RVLOCAL 



P 
P 



P 
P 
P 





#40000+n 
#40000+n 

y"/0 

#40000+n 
#40000+n 



op p,2 

" "")A : 



LD/ 
LDA 



op @p,2 
LDv ^ 



P.2 
P.2 



opn,3 
op@n.3 



>A 3 @p.2:op@n,3 
LDA 3 

[p is the offset of the 
word in the framej 



(s is a static) 



s 

s!n 
@(s!n) 



(@s)!n 
@((As)!n) 



LABEL 
LABEL 
LABEL 



RVLABEL 1 
RVLABEL J 
RVLABEL 1 



(z is a pure zero static) 





#40000+n 
MOOOO+n 





#40000+n 
#40000+n 



z 


COMMON 1 





z!n 


COMMON 1 


#40000+n 


@(z!n) 


COMMON I 


#40000 +n 


@7. 


RVCOMMON 1 





(@z)!N 


RVCOMMON I 


#40000+n 


@((@x)!n 


RVCOMMON 1 


#40000+n 



(k is a number, or a something name) 



k 

k!n 

@(k!n) 


NUMBER 
NUMBER 
NUMBER 


k 
k 
k 




#40000+n 
#40000+ n 


!v s 


LVLABEL 

LVCOMMON 


I 
I 






(Iv s)!a 


LV LABEL 
LVCOMMON 


I 

1 


#40000+n 
#40000+n 


@(iv s)ln) 


LVLABEL 
LVCOMMON 


1 

I 


#140000+n 
#140000+n 



op @.+N 

LDA 3 @.+N: op n.3 

LDA 3 ©.+ N; op @n.3 

[.+ N contains the 

address of the above] 

op @.-i-N 

LDA 3 @.+N; op n.3 

LDA 3 @.+ N; op@n.3 

[.+ N contains the 

address of the static 

+ #1000001 



op a 
LDA 3 
LDA 3 
op@a 
LDA 3 
LDA 3 



a ; op n,3 
a ; op @n.3 



©a; op n.3 
@a ; op@n.3 
[a is the address of 
the static in page zero] 



op .+N 

LDA 3 .+N; op n.3 
LDA 3 .+n ; op @n.3 
|[.+ N contains trie 
value k] 

op .-i-N 



LDA 3 .+N ; op n,3 



LDA 3.+N : op@n.3 
f.+ N contains the address 
of tlie static or page-zero 
static.} 



CGmcmrcf handles RVNUMBER specially if rcflarg is 0. 



(1) if the v.'tliie of th constant is between and #377: 

@ k RVNUMBER k op k 

[i.e., like a page- 
zero staticj 

(2) if the value of the constant is between #100000 and //100377. and if not compiling 

@ k RVNUMBER k op @ (k S #377) 

[i.e., in indirect reference throiigli a page-zero location."} 
Otherwise, 

RN NUMBER k oi> @ +N 

rt/SSfi'f(^ } #'10000^n 1J)A 3" @.-,N ; op n.3 

hVlsUMhhR k #140000f-n LDA d @.+ N ; op @n,3 

[.+ N contains the 
value kj 



Note: The opertlnd type RVLABLL is never generated if compiling for the Alio (because 
.op @.-.-N". where ;>N" contains 'V/IOOOOO + address of static" does not do a niiilti n!e 
indirection, as it does on the Nova). Instead, "©s", wliere s is a static, generates 



LDA 3. @,+N y/.+ N contains addr of s. 
op 0,3 



The more complex cases ("@s)!n" and "@@s!n)") are done by 



LDA 3 @.-i-N 
STA 3 temp. 2 



and setting the ary to RVLOCAl^ temp.<rer>. 

The principal routines which manipidate tlie addressing modes of operands on nrj-vec are 
CGrv() and CCsuSjscr(j), both in BNCG-1. 

CG'rvO applies the "rv" operator to argl. 

CGsiibserO) essentially does (:iig2!(ar<;I i-j)), where j is a number between #-200 and 



I he PLUS Ocode operator also manipulates addressinc^ modes (CGphis in BNCG7) It looks 
ahead al the next operator; if a VECAP, STVECAP, Vi^QUAU or STWtWAL operator is next' 
It irie.s to do tilings in an order which will generate reasonable code. IIu.. CGuhts does 
"(\-i-m as if it had been written ••(v+i)!l\ "* '"'■'»""^ ^'"^^ 



Also, the structure code generation routines in BNCG8 do a considerable amount of 
address- mode manipulation. 



